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Abstract 

We study corruption as a generalized epidemic process on the 
graph of social relationships. The main difference to classical epidemic 
processes is the strong nonlinear dependence of the transmission prob- 
ability on the local density of corruption and the mean field influence 
of the overall corruption in the society. Network clustering and the 
degree-degree correlation play an essential role in corruption dynam- 
ics. We discuss phase transitions, the influence of the graph structure 
and the implications for epidemic control. Structural and dynamical 
arguments are given why strongly hierarchically organized societies 
like systems with dictatorial tendency are more vulnerable to corrup- 
tion than democracies. A similar type of modelling can be applied 
to other social contagion spreading processes like opinion formation, 
doping usage, social disorders or innovation dynamics. 



1 Introduction 

Corruption seems to be an unavoidable part of human social interaction, 
prevalent in every society at any time since the very beginning of human his- 
tory till today. In sharp contrast to the high prevalence of corruption in many 
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countries and the rather large literature on political, social and economical 
aspects of corruption there is only a small number of attempts to model the 
dynamics of corruption in a mathematically quantified way. The modelling 
approach in these few attempts essentially follows two paths. The first is in 
the sense of microeconomics and incorporates game theoretic aspects (for a 
recent model in this direction see the book by Steinriicken |T7j and the ref- 
erences therein) or rules for maximizing a certain economically based profit 
functional (PH|^II|). Then a set of differential equations for the evolution of 
the mean corruption is derived and a stability analysis done on that basis. In 
these models one usually makes rather detailed assumptions about the under- 
lying organization structure on which the individuals interact. The second 
line of approach is more in the sense of cellular automata (CA) models with 
rather simple state variables and local interaction dynamics. For example in 
the article by Wirl 19 a simple 1-dimensional deterministic cellular lattice 
automata model is used to describe the propagation of corruption. Never- 
theless, as is well known in CA-modelling, the global dynamical picture can 
be highly complex and nontrivial. 

Up to now all these attempts did not take into account the complex 
network of social relationships as the underlying structure for the spread of 
corruption. In this article we will present a model for the spread of corruption 
on complex networks in the spirit of epidemiology. The model describes 
aspects of the evolution of corruption in a virtual population and incorporates 
some basic universal features of corruption. The local interaction dynamics 
of the model is similar to cellular automata but "lives" not on a lattice type 
graph like most of the CA-models but on complex networks. 

Considering corruption as a nonstandard epidemic process relies on the 
plausible assumption that corruption rarely emerges out of nothing but is 
usually related to some already corrupt environment which may "infect" sus- 
ceptibles. Of course the spontaneous decision of somebody to act corruptly is 
possible and could easy be handled in the model as an external weak source of 
infection. One of the very special features in corruption propagation which 
differs from what is used in describing classical epidemic processes is the 
threshold like dependence of the local transition probabilities. By this we 
mean that a noncorrupt individual gets infected with high probability if the 
number of corrupt individuals in the group of his direct social contacts (en- 
coded as the set of neighbors in a "friendship" or acquaintance graph) exceeds 
a certain threshold number. Otherwise if the number of corrupt individuals 
in somebodies social neighborhood is below that threshold value there is only 
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a small probability to get corrupt via such "local" interactions. The second 
main difference to classical epidemic processes is the mean field dependence 
of the corruption process. By this we mean that an individual can get cor- 
rupt just because there is a high prevalence (or believed prevalence) in the 
society even when there is no corruption in the local neighborhood. There 
is another interesting mean field term entering the game, namely the society 
strikes back to corruption with an efficiency proportional to the fraction of 
the noncorrupt people. Both mean field terms are nonlinear and together 
with the local propagation mechanisms they give rise to a rather complex 
dynamical picture. 

2 What is corruption? 

Corruption is a substructure of human social interaction. Common sense 
associates corruption mainly with a deviation from fair play interaction in the 
development of social relations. Clearly what is meant be fair play depends 
on the cultural context of a given population/society. This vague description 
of corruption is in the spirit of sociology and psychology and differs from the 
more narrow corruption concepts usually considered in economics or political 
sciences. There, corruption is mainly seen as a misuse of public power to gain 
profit in a more or less illegal way. In any case, corruption has many different 
faces in its concrete appearance and no single model approach will be able to 
describe the whole picture in an adequate way. But this does not at all imply 
that mathematical models are useless in this situation. They can provide a 
substantial improvement in our understanding of corruption as long as one 
clearly defines the aim and limitations of the taken approach. 

For the model approach developed in this article we will use the notion 
of corruption in the more general, first sense. More precisely our intention 
is to describe changes in mind ranging from damming of corruption as a 
criminal act to accepting corruption as an attractive option. Therefore in 
this paper we do not introduce the group of state representatives or officials 
since we assume that the essential changes in mind which allow corrupt acts 
happen long before an individual is in the position to act corruptly. Empir- 
ical investigations about motives and "typology" of corrupt actors (see [3] 
for results from case studies in Germany) have shown that the majority of 
individuals involved in corruption affairs are highly educated, well positioned 
with respect to social status and do not think to have done something wrong, 
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Figure 1: Corruption Perception index 2004 versus rank 



indicating the importance of mind changes prior to corrupt acts. 

There is a notorious problem in finding good empirical data which would 
allow to estimate the prevalence of corruption. Probably the greatest effort 
over the last years to measure the degree of corruption in various countries 
was made by "Transparency International" (TA), a non profit group of in- 
dividuals and organizations which are highly concerned by the lack of sound 
data. Since 1995 they publish a yearly corruption report and a so called 
Corruption Perception Index (CPI) ^H]- TA is well aware of the subjectivity 
in peoples perception of corruption but hopes that the large number of cases 
involved in the CPI averages out most of the bias. Figure Ogives a CPI-rank 
plot of the 2004 date from TA. Note that a value of 10 for the CPI corre- 
sponds to the absence of corruption. For 2004 Finland holds the top ranking 
and Germany is on place 15 with an index of 8.2. 

It is not our aim to explain the values of the CPI or other corruption 
data sets, since this would require a semirealistic modelling of the social and 
economical structure of individual countries which is completely illusionary 
at the present stage of research. Rather we want to demonstrate which 
scenarios are dynamically possible and whether there are phase transitions. 
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3 Corruption as a generalized epidemic pro- 



cess 

In this section we first describe the basic setting for our model structure. 
Refinements and more detailed aspects will be discussed later on. Due to the 
common view, corruption is first of all a property of the relations between 
individuals irrespectively which definition of corruption one uses. Since an 
act of corruption requires that at least one of the participants in a corrupt 
relation has a mental state which tolerates or even assigns a positive value 
to (his personal view of) corruption we will focus mainly on the spread of 
this mental state change (from not accepting to accepting corrupt acts as 
an option for one's own activities). Therefore to discuss corruption as an 
epidemic process in the afore mentioned sense it is useful to assign a cor- 
ruption property to the individuals themselves. In the simplest case we just 
have a time dependent — 1 state variable uj (x, t) assigned to each indi- 
vidual, encoding whether the vertex is corrupt (1) or not (0) at time t (of 
course more refined scales for the degree of corruption are possible and will 
be discussed in a forthcoming paper). The underlying structure on which 
corruption spreads is a given finite graph G from a random graph space Q 
with fixed vertex set V = {1, , n}. Furthermore we consider in this arti- 
cle only stationary graphs with no changes in time on the underlying graph 
structure (the study of corruption on evolutionary graphs requires a paper 
in its own). The dynamics is specified by conditional transition probabilities 
(Pij (x)) which depend mainly on the states on B\ ix) — {y : d (x, y) < 1} and 
a meanfield term reflecting the influence of the total prevalence of corruption 
in the society. Here d(., .) is the usual graph metric on G G Q and d[x) is 
the degree of x. We define b t := uj (y,t) as the density of corruption at 

y&V 

time t. The standing assumptions on (p^ (x)) are the following: 

poi (x)=Pi{u(x,t + l) = l\w(x,t) = Q} = f x l^2w(y,t) \ +P(x)-b 2 t 

Pl0 (x) = Pt{lu (x,t + l) = 0\ lu (x, t) = 1} = 7 (x) ■ [1 - bt] (1) 

in other words the probability to become corrupt depends only on the local 
prevalence of corruption among the neighbors and the mean corruption in 
the society and individuals who became corrupt can cure from corruption 
with a rate proportional to the density of the noncorrupt individuals in the 
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society. In classical i.i.d. epidemics one would have the following as functional 
dependence for the local part of the conditional probabilities: / (k) — 1 — 
(1 — e) k which is for small e and k proportional to ek. For corruption the 
function / is more like in voter models, that is below a critical value A (x) 
of the number of corrupt individuals in B\ (x) the value of / is close to 
zero and above A (x) it is a number a (x) much larger then zero. Due to this 
property local clustering can force the epidemics to spread whereas in classical 
epidemic processes high clustering slows down the spread of an infection due 
to reinfection of the already infected. We want to illustrate this by two simple 
examples. 

Example 1 The simplest, almost trivial example is the Z 1 lattice with addi- 
tional edges to the next-nearest neighbors. Setting f (1) = /i > and f (i) — 1 
for i > 1 it is easy to see that there is a nonzero probability for infecting all 
vertices starting with one infected individual at time 0. 

Example 2 The infection function f will be the same as in example 1. We 
start with a regular tree of degree 3. Replacing each vertex by a triangle and 
gluing the triangles along the former edges of the regular tree gives a regular 
graph of degree 4 where the triangle corners act now as the new vertices. In 
each neighbor pair of triangles (A,B) (that are the triangles which have a 
common vertex) we form an edge randomly between the set of vertices lying 
in A\B and B \ A (see Fig. 1). Once a triangle is infected the corruption 
jumps to all the three neighbor triangles due to the extra random edge present 
between each neighbor pairs of triangles. Hence again we have a nonzero 
probability that the whole graph becomes infected. 

In the above examples we have used a very simple and somehow extreme 
form of the infection function /. In the following we will investigate the 
situation for two canonical subclasses of infection functions. We say that 
/ is a vertex independent, fixed threshold infection function if there is a A 
such that / (i) = e for < % < A and / (i) = a 3> e for % > A. For the 
second class of functions we assume the threshold to be degree dependent. 
Namely we call f x a vertex dependent, relative threshold function if for some 
5 G (0, 1) we have f x (?) = e for < i < 5d (x) and / (i, x) = a ^> e for 
i > 5d (x) . Furthermore we say that / is a voter- type infection function if 
/ (i, x) = e for i < (x)\ and / (i, x) = a for i > (x)\ . In this paper 
we will mainly investigate the spread of corruption for the fixed threshold 
case. 
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Figure 2: A highly clustered network with underlying tree structure. 



process name 


characteristic 


typical value 


a - process 


the local transmission process for 
# of corrupt neighbors > A 


a >> e,/?,7 


(3 - process 


the mean field transmission process due to the 
total prevalence or perception of corruption 


e < (3 < 7 


7 - process 


the corruption recover /elimination process 
due to the fight of the society against corruption 


j3 < 7 < a 


e - process 


the classical local epidemic process for 
# of corrupt neighbors < A 


£ « a, (3, 7 



Table 1: the different processes for the corruption dynamics 



To distinguish between the different ways in which an individual can be- 
come corrupt we will speak about the a, (3, e or 7— process. For convenience 
of the reader we give in tabular 1 a summary of the different processes. 

Note that in contrast to standard voter models we do not have the possi- 
bility of a locally induced backflip from the corrupt state to the noncorrupt. 
A kind of quenched disorder could easily be introduced by randomizing the 
relevant parameters individually but this will be the subject of a forthcom- 
ing paper. Generalizations of classical epidemic dynamics to processes with 
a local threshold have recently also been studied in the context of models of 
contagion (see [§| and references therein) but not yet been mixed with global 
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mean field processes. 



4 The structure of social networks 

In the last 10 years there has been an enormous progress in understanding 
the fine structure of social networks. This is mostly due to the availability of 
large data sets for some special social networks like E-mail correspondence, 
coauthorship network in scientific publications and movie actor networks to 
name just a few prominent examples. All these networks of social relations 
share three remarkable properties of the associated graph which are: 1) the 
diameter scales at most logarithmically in size 2) the graphs have a very 
high, asymptotically non-vanishing clustering coefficient- in other words the 
graphs are locally far from being tree-like 3) the degree distribution follows a 
power law (scale-free graphs). Properties 2 and 3 have striking consequences 
for the spread of corruption as will be discussed later on. 

There exists meanwhile a large collection of algorithms to generate com- 
plex networks with the above mentioned properties. 

A widely used quantity to measure the local clustering is the triangle 
number A(x) := ^{triangles containing x} and it's averaged value A. A 
natural generalization is the k— clique number Ck {x) defined as the number 
of complete graphs of order k containing x. In social network graphs A (x) is 
usually proportional to d (x) and A becomes independent of the population 
size for large N and stays bounded away from zero. Another very remarkable 
property of real networks is the power-law distribution for the degree. By an 
asymptotic power-law distribution for a discrete random variable d we denote 
every functional behavior of the form Pr {d = k} = k~ x+0k ^ with exponent 
A > 1. Most real networks have exponents between 2 and 4 (see PQ for an 
excellent overview). Classical epidemic processes on such graphs have been 
studied by many authors, and perhaps the most astonishing result in this 
context is the absence of an epidemic threshold in case the exponent is below 
3 (C3)- This phenomenon is related to the existence of a massive center of 
size independent diameter induced by the high number of hubs (vertices with 
an exceptional large degree). Hubs play also a significant role for a - process 
as will be explained in the next section. 

One of the main differences between corruption epidemics and classi- 
cal epidemics is the different effect of clustering on the epidemic threshold 
and the total number of infected individuals. In the classical situation any 
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epidemics will be slowed down by the presence of local cycles due to the 
high probability of reinfection. In corruption epidemics local clustering may 
speed up the propagation of corruption due to the nonlinear dependence of 
the infection probability on the number of infected neighbors as was already 
demonstrated in example 1 in the previous section. In the next section we 
will give two further examples where the strength of this effect can explicitly 
be computed. 

5 Phase transitions for the a - process 

In this section we want to look at some threshold properties associated to the 
a— process. We are still far from a good understanding of the quantitative 
picture of this kind of processes for a given type of graph, which is mainly a 
consequence of our lack of knowledge how to handle graphs with high local 
clustering in a mathematical satisfactory way. In this section we want to state 
just some general observations and numerical results concerning the spread of 
threshold-like dynamics. Furthermore we will analyze two examples of tree- 
like graphs which might serve as an illustration. A more careful mathematical 
analysis of a— processes requires a paper in its own. 

One of the remarkable differences between a classical epidemic process and 
a process based on local threshold dynamics is the dependence on the initial 
number of "infected" vertices in the latter case. Classical epidemics does not 
know such things- either an epidemic process is overcritical (reproduction 
number Rq > 1) and a single initial infected vertex infects with with positive 
probability a positive fraction of the whole population, or the process is below 
criticality (Rq < 1) and all infected will die out respectively become healthy. 
In corruption epidemics both parts - the mean field process as well the local 
a— process - can have phase transitions with respect to the initial number 
of corrupt vertices. That means, there is critical initial density of corrupt 
vertices b$ such that for initial densities below 6q the number of infected stays 
as it is or goes down to zero. Above 6q the entire population becomes corrupt 
with high probability. As an illustration we give in Fig. El the dependence of 
&o on the edge density |ona classical random graph space Q (N, M) with 
N vertices and M edges. 

Although in this paper we mainly concentrate on the case of absolute 
threshold values A we give for comparison in Fig^the edge density depen- 
dence result for a relative, degree dependent threshold A (x) = [0.8 • d(x)~\. 
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5000 10000 15000 20000 25000 30000 

M = #Edges 



Figure 3: numerical estimation of the critical initial density 6q as a function 
of the number of edges M for the following parameter values: A = 5; a = 
0.35;/? = 0.08; 7 = 0.04; e = 0.005iV = 4000 .Vertical segments are errorbars 
over 20 runs. 



There is still a critical density but its value increases with the edge density 
since the mean threshold increases now proportional to the mean degree. As 
already mentioned in section 3 one expects that the presence of clustering 
(respectively many triangles) decreases the critical density by since the a- 
process can propagate more easily. In Fig. El the effect of the increase of the 
triangle number is clearly to see. Here we used a modified Q (N, M) random 
graph space where randomly triangles are added (keeping the total number 
of edges constant). The threshold value A was chosen to be 2 since for higher 
A one has to add higher order complete subgraphs instead of triangles. 

The next figure (Fig. EJ) shows the dependence of the critical density on 
A. The two curves represent the threshold values for an end-prevalence of 10 
respectively 90 percent. Since the mean degree in this simulation is about 6.5 
one has a vanishing contribution of the a- process above A = 8. The critical 
threshold 6q stays than essentially at a value given by the mean field process 
(see next section for details). To get an impression of the contribution of the 
different kind of processes (local a and s, global j3 and 7 - for details see next 
section) to the end-prevalence we give in Fig. Qthe accumulated number of 
state changes caused by each of the subprocesses till saturation. For small 
values of A the a- process dominates all others. 
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Figure 4: critical initial density 6q as a function of the number of edges M 
for relative delta and parameter values: A (x) = [0.8 • d (x)] ; a = 0.35; (3 = 



0.08; 7 
runs. 
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Figure 5: Critical density b$ versus triangle density for the parameter values: 
A = 2; a = 0.3; (3 = 0.08; 7 = 0.04; e = 0.005; N = 1000; M = 2000. 
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Figure 6: Lower and upper bounds for the critical density b$ as a function of 
A for the following parameter values: N = 1500; M = 5000; a = 0.35; (3 = 
0.08; 7 = 0.04; e = 0.005 




Figure 7: Total number of state changes splitted according to the different 
subprocesses as a function of A for the same parameter values as in FiglHl 
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We turn now to a more theoretical consideration, namely which type of 
vertices (type in the sense of degree and local clustering) are especially well 
suited for the propagation of corruption via the a— process. Assume we have 
given a random scale free graph space Q with N vertices. We further as- 
sume that there are two types of edges (according to the way they where 
generated) the independent ones, generated at random with just preferences 
to the degree (like the preferential attachment rule by Albert&Barabasi or 
the "Cameo-Principle" in 0]) and local ones which are relevant for the cre- 
ation of triangles. Let further the (asymptotic) degree distribution given 
by (p (k) := Pr {d (x) = k} = B ^ with k min > k and B (k ) the normal- 
ization constant. The independent edges are generated with probability p 
and each individual generates k > 2 edges by himself. From jlj one knows 
that the triangle number A (x) is proportional to d (x) with a proportion- 
ality constant C (p ). A basic quantity in highly clustered networks is the 
probability q(x) that two random chosen elements from Ni (x \ d (x) > 1) 
have a common edge. Since A (x) ~ crd (x) one obtains for the conditional 
probability q k (x) := Pr {z ~ y | z, y e Ni (x) A d (x) = k} ~ jgj- ~ 22c. As- 
suming that the generation of triangles is a sufficiently independent process 
one obtains for the conditional I— clique number E (Cj (x) \ d(x) = k) = 

(i-l)(i-2) (i l)(l j ~ 2 ) 

G-i) (^") ~ ~ — ' cons t- Here C\ (x) is the number of l- 

cliques (complete graphs of order I) containing x. For I > 4 the power in the 
A;— dependence gets negative and hence the high degree vertices contribute 
almost nothing to the Clique-clustering. Of course all this consideration rely 
on the assumption of some kind of independence in the triangle-formation 
process. In any case this results indicate that highly clustered medium degree 
vertices are especially well suited for the spread of corruption. A similar kind 
of analysis can be carried out for random graph models which have an intrin- 
sic high probability to generate local cliques e.g. intersection graphs (for an 
introduction to random intersection graphs and comparison with Erdos-Renyi 
random graphs see ^2] and JI]). The above arguments seem to support the 
conjecture that in corruption epidemics the vertices from the tail of the de- 
gree distribution play a less dominant role. This is indeed true in the case of 
a relative, degree- dependent threshold where hubs are much more difficult to 
infect than medium or low degree vertices. For absolute thresholds in the a- 
process the situation is more complex since for scale free degree distributions 
with small exponents (A < 3) there are other mechanisms than local clus- 
tering which can cause a radical dropdown of the critical initial density. In 
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- P_10% end prevalence - undercritical 
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Lambda, exponent of ScaleFree tail 



Figure 8: b$ as a function of the exponet A in a scale-free degree distribution 
with parameters: N = 20000; M = 50000; A = 5; a = 0.35; /3 = 0.08; 7 = 
0.04; e = 



Fig |H] and Fig|U]we give numerical results for the relation between the critical 
density 6q and the exponent A keeping the edge density fixed. There is a 
clear phase transition around A ~ 2.3 for A = 5 and A ~ 2.9 for A = 2. The 
explanation of this observation is closely related to a structural phase tran- 
sition in scale free random graphs at A = 3 - namely that for most vertices 
x an asymptotically positive fraction of all vertices has bounded distance to 
x. To link this property with the a - process one has to look more closely on 
the degree-degree correlation in scale-free graphs. Depending on the choice 
of the model one can have very different correlations like: 

k + k' 

Pr {x ~ y | d (x) — k A d (y) — k'} ~ const ■ — — — or (2) 

k ■ k' 

Pr {x ~ y | d {x) = k A d (y) = k'} ~ const ■ — (3) 

. Formula El holds for instance for the Cameo - model ([I]) whereas formula 
El is valid for scale-free graphs generated via the Molloy&Reed algorithm 
(the later one represents the random graph space containing all graphs with 
a given scale-free degree distribution equipped with the uniform measure 
and was used for the simulations in Fig|H] and EJ) • Evolutionary graphs like 
the Albert&Barabasi model have usually asymmetric and more complicated 
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Figure 9: upper and lower bounds for 6q as a function of the exponet A in a 
scale- free degree distribution with parameters: N = 20000; M = 30000; A = 
2; a = 0.35; (3 = 0.08; 7 = 0.04; e = 



correlations. Since a detailed analysis of the a - process for scale-free graphs 
is beyond the scope of this paper we just give a heuristic outline why in 
graphs with a correlation as in formula E3 the threshold density 6q tends to 
zero as iV — > 00 for exponents A < 3. For fixed bo > N^~ u and v > (note 
that the typical maximal degree is about N*) it is obvious that vertices x 
with d(x) > k >> ^ get almost surely infected (as iV — > 00) via the a - 
process as soon as 7 < a. Let Ak be the set of such vertices. One the other 
side it follows from El that a vertex y with d(y) = k < k is linked to the set 
v4fc with probability 



^max~A r A / , t t / \ Const- N , 

1 TT I 1 const ' k ' k \ (a\ 
|| 1 (4) 



k'>k 



~ 1 - e fc '^ fc « ( } ~ 1 - e fc o (5) 

. Since is close to 1 for k > k,Q~ 2 one has an almost sure multiple linkage 
of vertices y with d(y) > k^ 2 < k to the set A ko . These vertices get 
now again infected via the a - process. By iterating this procedure one 
may arrive at an positive N- independent infection density b t » bo such 
that the (3 - process is overcritical and finally the whole vertex set becomes 
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corrupt. The mechanism described requires iV to be large and therefore 

we conjecture that the difference to the numerical results depicted in FigJHl 

(phase transition at A < 2.3 instead of 3) is due to finite size effects. In 

the case of A = 2 (FigEJ) the finite size effects are smaller and the phase 

transition is closer to 3. A similar kind of arguments shows, that the expected 

path- length is finite for A < 3. Namely since the expected number Si of 

vertices at distance I from a vertex x with degree ko is approximately given 
i 

l N ~* k k k k k k (i-l)(3-A) 

by (const) ■ Yl ~ K 7r~ [x 1 ' 1 ~ const ■ k ■ N t for A < 3 (note 
k u ...M {kl > ■-•W-J 

that this expression is only valid for I s.t. ^~ 1 ^ 3 ~^ . log ko < 1). The essential 
diameter diam e (a large fraction if the whole vertex set is within a ball of 
diameter diam e ) is then given by the smallest I such that ^~ 1 ^ 3 ^ A - > > 1 (for 
a more extensive discussion of the notion of essential diameter see For 
A = 2 one obtains therefore diam e = 3. For A > 3 the essential diameter is no 
longer bounded but growths logarithmically in N. It is interesting that the 
jump in the critical density at 2.3 in Fig|H] coincides with a jump in diameter 
from 4 to 5. A small essential diameter can have fatal consequences for 
corruption epidemics since most vertices are closely linked to hubs and, as was 
outlined above, hubs are with high probability corrupt. A precise estimation 
of the dependence of 6q from N, M and A requires a careful discussion of the 
involved constants. For scale-free graphs with additive degree correlation 
like Cameo-graphs one still has a bounded essential diameter for exponents 
less than 3. But the first argument about chains of almost sure linkages 
from high degree to low degree vertex sets can not be adopted. One expects 
therefore a higher value of the critical density b^. In Fig^Jand Fig. fTTI we 
give numerical results for a scale-free graph with additive degree-correlation 
generated via a modified Molloy&Reed algorithm 1 

To compare with the multiplicative case we have chosen the same param- 
eters and degree-distribution as for FigEl There is a clear increase of 6q to 

1 In the usual Molloy&Reed algorithm one generates d (x) virtual vertices for each vertex 
x and makes than a random matching between the virtual vertices. Two vertices x and y 
are connected by an edge if there is an edge between two corresponding virtual vertices. 
To generate an additive degree correlation we mark M virtual vertices as red such that 
each vertex x has at least one red and at most C [4$] red associated virtual vertices (if 
there are note too many vertices with very small degree, the constant C can be choosen 
as ). Than the marked red virtual vertices are randomly matched with the unmarked 
ones. 
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Figure 10: upper and lower bounds for 6q as a function of the exponet A 
in a graph with scale-free degree distribution, additive degree correlation 
and parameters: N = 20000; M = 30000; A = 2; a = 0.35; (3 = 0.08; 7 = 
0.04; e = 




Figure 11: b$ as a function of the exponet A in a scale-free degree distribution 
with parameters: N = 20000; M = 50000; A = 5; a = 0.35; (3 = 0.08; 7 = 
0.04; e = 
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observe but, although unlikely, it remains open wether there is a vanishing 
threshold in the limit iV — > oo. For intermediate couplings we still expect 
&o (N) — > as N diverges for A < A c G (2, 3) where A c depends on thew 
concrete model. It is remarkable that low A and a tendency to multiplica- 
tive correlation is mainly expected to hold in societies with strong hierarchi- 
cal structures of social dependencies e.g. dictatorships (see jH] for details), 
whereas democracies are characterized by less strong degree correlation. 

Finally we will discuss two examples of graph structures where the critical 
infection density can be explicitly be computed. The first one is a regular 
infinite tree of degree 4 where of course no triangles are present (see Fig. [T2")) . 




Figure 12: Segment of a regular infinite tree of order 4 

. The second structure is a regular infinite graph of again of degree 4 with 
positive local cluster coefficient (A (x) = 2) and a global tree-like structure 
(see FigHU). 

In both cases an exact computation of the critical infection density is 
possible. We give a short outline for the case of threshold value A = 2 
and a = 1 (the case a < 1 requires more lengthy computations but can 
be done in a similar fashion) and start with the case of the regular 4 tree. 
An initial configuration is given by marking each vertex with probability p 
as noncorrupt (black) and with probability 1 — p as corrupt (white). We 
ask for the critical probability p c such that for p < p c almost surely the 
entire tree becomes white (corrupt) and for p > p c there remains an infinite 
cluster of noncorrupt (black) vertices with probability one. Note that no 
finite cluster of black vertices -that is a finite black subgraph surrounded by 
white vertices- can survive so there are either infinite black clusters or none. 
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but got eventually corrupt 
(local threshold =2) 



Figure 13: Segment of an infinite generalized tree (degree 4 and branching 
number 3) 



We call an invariant infinite black cluster immune. Since A = 2 any vertex 
in an immune cluster must have at least three black neighbors from that 
cluster. Denote by T R (3) the rooted tree with outdegree 3 (fixing a root 
gives a canonical direction to the edges of the tree so it makes sense to speak 
about the outdegree of a vertex). Every vertex has degree 4 except the root 
which has degree 3. Let x be the p— dependent probability that the root is 
contained in an immune cluster (as a subgraph of Tr(3)) conditioned that 
the root vertex is initially black. By arguments from the general theory of 
branching processes x equals the largest solution of the following recursion 
equation 




a) b) c) 

Figure El displays the different situations which enter the above equation. 
The solutions are 7^3 (\p 2 ± \ \/—8p 3 + 9p A j and 0. Since — 8p 3 + 9p 4 > is 
needed to have a positive nonzero solution we get for the critical probability 
p c = § ~ 0.888 89. 

In a similar fashion one can derive a recursion equation for the generalized 
tree case. For that let Tr (2, 1) be the rooted generalized tree shown in Figll5l 
To every vertex is attached an outgoing triangle, hence the degree of a vertex 
is 4 except the root which has degree 2. To settle the question about p c for the 
original generalized tree it is enough to analyze the corresponding problem 



19 




Figure 14: Different configurations in the neighborhood of the root vertex.. 
Black denotes vertices in an immune cluster and grey an initial black vertex 
which became white. 




Figure 15: The local picture around the root vertex in Tr (2, 1) 
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for Tr (2, 1). Again let x be the probability that the root vertex is in an 
immune cluster conditioned that the root is initially black. One gets the 
following recursion equation 

x = p 2 x 2 + 2p i x 2 (1 - x) + 2p (1 - p) p 2 x 2 (7) 



b) c) a) 

(see FigEJ). The solutions are ^ (±p 2 + p 3 ± |a/-7p 4 + 4p 5 + 4p 6 j and 0. 

Again since — 7p 4 + 4p 5 + 4p 6 > is needed to get a positive nonzero solution 
we get for the critical probability p c = \p2 — \ — 0.914 21. That means the 
presence of clustering in this example lowers the critical initial density needed 
to infect the whole graph by almost a factor of |. 

The study of the regular 4— tree generalizes easily to the case of regular 
n + 1— trees (n > 2). The recursion equation in this case is 

x = p n x n + np n x n ~ 1 (1 - x) + np n ~ x (1 - p) x^ 1 . (8) 

A straightforward but lengthy computation gives for the critical probability 

/ \2n— 3 

p c = [n ~ L) - ■ n > 2 . (9) 

In the special case of a 3— tree (n = 2) one obtains p c — |. For completeness 
we give without proof the formula for the computation of the critical proba- 
bility in case of a rooted random tree with arbitrary outdegree distribution. 
Let g (z) = diZ 1 be the generating function for the outdegree; that is dj 

i>2 

is the probability that a random chosen vertex has outdegree i (and hence 
total degree i + The critical probability p c is given by the smallest p such 
that the equation 

- = (l-z)g'(z) + g(z) (10) 

has a positive real solution. 

A careful reader may have noticed that there is a big structural difference 
between the generalized tree in example 2 of section 3 and the generalized 
tree just discussed above. Namely the graph of the first example has the 
property, that any two vertices can be linked by a chain of triangle where 
neighbor triangles always have a common edge. The graphs in the examples 
of this section do not have this property since neighboring triangles have only 
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a common vertex. For threshold values A > 2 one has to consider chains of 
A + 1 cliques. We say that a graph is well k -linked if any pair of vertices can 
be linked by a chain of complete graphs of order k such that all neighboring 
k— cliques have a k — 1- clique in common. For well k- linked graphs the 
critical density b° c is zero (a finite number of initially infected vertices can 
already infected a positive fraction of the vertex set) for a— processes with 
A < k whereas for graphs which are not well linked one needs a positive 
critical density. 

The above study on trees or generalized trees is insofar important as in 
most random graph models used for complex networks one has as a tree or 
generalized tree as the typical local structure around a random chosen vertex. 
Furthermore the dependence of the corruption dynamics on graph properties 
like edge density or degree distribution is in large parts of the parameter 
space entirely caused by the a— process. 

6 State and individual (mean field /3— and 
7— process) 

In this section we want to have a closer look at the mean field dependence 
of the corruption process. To gain some insight in the possible type of be- 
havior we start with some simple assumptions which will be refined later 
on. Again we will argue in a discrete time model but the transition to 
continuos time makes no problem and gives the same results. Let b t the 
density of corrupt people at time t. We assume that the affinity for an indi- 
vidual to change its behavior from noncorrupt to corrupt increases propor- 
tional to the corruption prevalence. Furthermore to become really corruptly 
minded an individual has to overcome some fear which we put proportional 
to (1 — bt). Formally this reads as Pr {uj (x, t + 1) = 1 | uo (x, t) = 0} = (3bf 
with (5 G [0, 1]. Corrupt individuals can recover due to state and police ef- 
fects (uncovering, fear etc.). Again it seems reasonable to assume that the 
probability to recover is proportional to 1 — b t since only the noncorrupt 
part of a society is willing to fight corruption. Formally we will assume that 
Pr {uj (x, t + 1) = I uj (x, t) = 1} = 7 (1 - bt) with 7 G [0, 1]. This gives 

b t+1 = (i-b t )pb 2 t +b t -b a (i-bt) 

= 6 t (l-7) + 6?(7 + 0)-6?0 (11) 
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with the two obvious fixed points and 1. For f3 ^ there is a third interme- 
diate fixed point b* := An interesting phenomena happens for parameter 
pairs 7) s.t. 7 < (3 since under this conditions both fixed points at 
and 1 are locally stable. Hence there are two basins of attraction- one for 
and one for 1— with b* as the boundary point. In other words, if the initial 
percentage of corruption is less b* corruption stays under control whereas for 
an initial value larger b* things run out of control and a corruption collapse 
takes place. Of course this mean field part of the model is still very simplistic 
and one should not expect any quantitative fit with empirical data. But the 
qualitative statement seems to be quite stable with respect to modifications. 
For instance there are good reasons to believe that neither the mean field 
infection nor the mean field recover process are linear in b t . 

We want to end this section with a small modification of the mean field 
"Ansatz" where we include social weights. This is a natural and meanwhile 
very common approach in network dynamics and can easily be adopted to 
the corruption model. In the above argumentation on the attraction of be- 
coming corrupt it is plausible to assume that corrupt individuals with high 
social influence have a stronger influence on the mean field probability to 
get corrupt than individuals with low social importance. A similar argument 
holds for the recover probability. As a simple measure for social strength we 
use the degree of the vertices since high degree vertices are more likely to 
play a dominant social role than low degree vertices. Formally we introduce 
the weighted density by at time t as 

by := -* T (12) 

E(4) 2 

k 

where dk is the number of vertices with degree k and 1^ the number of 
corrupt (state 1) vertices with degree k at time t. The mean field equation 
for group k is now given by 

48 = P (Kf (dk ~ ll k) ) + - 7 (1 - K) J< fc) (13) 
. Multiplying the last equation by ^ and summing over k gives 

k 

by +1 = (1 - by) (3 (by) 2 + by - b- 1 (1 - by) (14) 
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which is the same as equation ([lip. Therefore the introduction of social 
weights does not add anything new to the dynamical picture. There is of 
course a difference in the interpretation since a small real initial prevalence of 
corruption can give rise to a high initial value of h™ as soon as the corruption 
is concentrated at the high degree vertices. Here also a difference between 
scale free networks and classical random networks is seen since in the scale 
free case high degree vertices (hubs) are much more frequent than in the 
classical case. 



7 Interaction between the mean field process 
and the local threshold dynamics (/?+7 ver- 
sus a) 

In this paragraph we will investigate some aspects of the interplay between 
the mean field process described in the previous section and the local, thresh- 
old dependent, corruption propagation. For a > 7 there is a core infected 
component generated via the a— process. To gain some insight how such 
a core infected part of the population changes the mean field dynamics we 
will assume that a certain fraction, say a, of the population is permanently 
infected and resistant to the 7— deletion process. Denoting by q t = b t — a 
the density in the noncore part of the population (the normalization here is 
still with respect to the total population size) we get the following mean field 
dynamics: 

q t+ i = (l-a- q t )(3 (q t + a) 2 + q t - q t j (1 - a - q t ) 
= a 2 (3 - a 3 (3 + q t (2a[3 - 7 + 07 - 3a 2 j3 + l) + 
+ g 2 (/3 + 7 -3a/?)-/?g? (15) 

. Since the state where all individuals are infected is stationary we get the 
following set of fixed points: 



+ 1, i (±7 - off - i + 7 2 ) , \ (57 " a/S + 



■4a/?7 + 7 ; 



. For — Aa/3'j + 7 2 < there are no real fixed points except q* = —a + 1 which 
becomes globally stable under this condition. Since we have a polynomial 

of degree 3 we get ^ ^7 — a(3 + \ \f—Aa(3 , ~) + 7 2 J < 1 as the condition for 
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the fixed point at 1 — a to be locally stable. Furthermore in this case also 

the fixed point at ^ ^7 — a(3 — —Aa(3^j + 7 2 j becomes a local attractor. 

This is for instance the case when a becomes very small and (3 > 7 - be- 
ing back essentially in the situation of the previous section. In case when 

^ f|7 — a(3 + I a/— 4a/?7 + 7 2 j > 1 it is easy to show that the fixed point at 

^ ^57 — a/3 — 4a/?7 + 7 2 j becomes a global attractor (to see this just 

note that the derivative at qt — is always positive for the relevant param- 
eter intervals). The above considerations show that the possible dynamical 
evolution scenarios are the same for a = and a ^ 0. But there is a very 
strong influence of a on the parameter regimes of (3 and 7 for which one has 
a corruption collapse. Whereas in case a = one is always in the basin of 
attraction of zero for bo sufficiently small and 7 7^ (in other words 6 = 1 
is never a global attractor) one can now have the phenomenon that only the 
complete saturation with corruption is stable ( q = 1 — a). As an example 
lets look at the case where (3 = 27. For a = there is a fixed point at 
b* = 0.5 and hence for an initial infection density 60 < 0.5 the pure mean 
field dynamics converges to zero. In the case a ^ one has for a > 1/8 
only the stable fixed point b* = a + q* = 1. At a — 1/8 there is a phase 
transition since a new indifferent (slope 1) fixed point at b* = 1/4 emerges. 
For a < 1/8 this fixed point bifurcates into two fixed points where the first 
one at b* = | — — 8a becomes locally stable with a basin of attraction 
given by b < \ + \y/l - 8a. 

We close this section by presenting a numerical result showing the dif- 
ferent contributions to the overall infection (end-prevalence) of the local and 
mean field processes as a function of the edge density in the random graph 
space Q (N,M). FigHH gives the accumulated number of state changes (di- 
vided by N) caused by the a, (3, 7 and e- process at initial density values 
slightly above the critical one. Up to an edge density of 2 (corresponding 
to a mean degree of 4) the f3— process gives the major contribution to the 
end prevalence in the overcritical situation. Parallel to the increase in the 
edge density increases the contribution of the a- and s- process (in the in- 
termediate phase of density between 2 and 3 dominated by the a- process) 
till a sharp peak at edge density 4.5 where the e— process outperforms all 
the others (at the same time the critical initial corruption density 6g drops 
down and becomes almost zero). The peak is easy to understand since for 
the chosen parameters we have at an edge density of 4 an equality between 
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Figure 16: The total number of state changes splitted according to the a, (3, 7 
and e- process for the following parameter values in a Q (N, M) graph: N = 
4000; a = 0.35; f3 = 0.08; 7 = 0.04; e = 0.005; A = 5. 



the recover rate 7 and the expected number of new corruptions caused by 
a single corrupt vertex via the e— process (which is K(d(x)) ■ e). In terms 
of classical epidemic processes this corresponds to the case of reproduction 
number Rq = 1. Above this value single initial corrupt vertex is already 
enough to cause in conjunction with the mean field process a total infection 
of the network. 

8 Single run simulation results 

In this section we want to present some simulation results of the corruption 
process taking place on some medium size complex networks. Small graph 
sizes are interesting as they are typical for communities in highly social struc- 
tured populations. As a simple to generate random graph space with high 
clustering and power law degree distribution we have chosen so-called inter- 
section graphs. Intersection graphs can easily be defined as follows. First 
one forms random sets from a finite base set of N elements (random means 
in this context that the set elements are chosen uniform i.i.d. from the base 
set). These sets constitute the vertices of a random graph. Edges will be 
defined via the set intersection property, namely there is an edge between i 
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FP2 Cl987-199:n 


FP3 (1990-1994) 


# vertices 


4879 


7710 


# edges 


57633 


93852 


mean degree 


23.624 


24.346 


maximal degree 


844 


1014 


# vertices wiht degree > 5 


3865 


6051 


size of largest component 


4775 


7356 


mean 7^ triangles per vertex 


256.89 


418.09 


exponent of degree distribution 


2.1 


2.4 



Table 2: Properties of the real networks FP2 and FP3 



and j if the associated sets Ai and Aj have nonempty intersection. The size 
(cardinality) \A\ of a set A is itself a random variable drawn i.i.d. from a 
pre-given probability distribution <p(k). To get interesting graph spaces one 
furthermore requires N < ^2 \ Ai\ < const ■ N. For theoretical results about 
the structure of random intersection graphs see [7] ^2] El • ^ ^ s worth noting 
that intersection graphs have a high clustering by definition (if an element is 
contained in say k sets simultaneously this k sets form a complete subgraph). 
Most simulations were done for the case when (p is an asymptotic power law 
distribution with exponent 3 or when ip is singular (all sets have the same 
size). Random intersection graphs have a multiplicative degree correlation 
and therefore the critical threshold should be very low for exponents less 
than 3 be the arguments from section 5. Above that value the form of the 
degree distribution has only little influence on the corruption propagation. 

Besides random intersection graphs generated according to some degree 
specifications we used also a collection of real collaboration graphs. These 
graphs come from a database about research and development projects funded 
by the European Community (FP2-3). It's vertices are organizations involved 
in European research projects. Two organizations are linked if they have a 
joint project (see table II for the main graph characteristics) . In total the data 
base contains about 8000 projects and 13000 participating organizations. In 
essence the network shows all the main characteristics that are known from 
other complex network structures like scale free degree distribution (with 
exponent between 2 and 3), small diameter and high clustering and vertex 
correlation. The initial fraction of infected individuals was either distributed 
at random over the vertex set or clumped together in a sufficiently large ball 
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Figure 17: Low, semistable prevalence in a real collaboration network (FP2), 
A=30 e=0, a=0.99 (3=0.09 7=0.545 6 =0.1 N=4879 



with a random chosen vertex as center. 

In the following we want to give a small sample of simulations on the just 
mentioned graphs and try to discuss its main features. FigEI displays the 
prevalence of corruption on the real network FP2. The absolute threshold 
value A = 30 is very high and does not allow for a big outbreak of corruption. 
But there is a metastable small community of individuals, highly linked and 
almost resistant to the 7— process. It took more then 800 complete updates 
till this structure broke down. 

The next figure^] presents a similar situation on an almost twice as large 
real graph (FP3). In contrast to the previous case we have a much smaller 
a— value and an only slightly reduced threshold A. The network 

FP3 is extremely high clustered (mean degree = 48.6, mean triangle number 
= 418 and a total of 7710 nodes and 187704 edges) and stays metastable 
with a very small corruption cluster for about 200 updates till it jumps 
by a factor 10 to another metastable state. FigE3 gives a more detailed 
view of the accumulated contributions by the different processes for a time 
interval around the jump in prevalence. In the initial phase the e— process 
was dominating the f3— process and vice versa in the second phase. The 
next pictures show a situation where after an initial phase of slow growth 
a corruption collapse happened. It seems that the absolute threshold value 
A = 20 is well below the critical value where the system can still stabilize. 
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Figure 18: switch from a very low prevalence semi stable state to a low 
prevalence state (FP3), A=25, £-=0.001, a=0.2, £=0.04, 7=0.03, 6 =0.005, 
N=7710 
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Figure 19: The contribution of the different infection paths for a segment of 
the prevalence curve in Fig. 
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Figure 20: Slow increase of prevalence till collapse (FP3),A=20 £=0.001 
a=0.2 0=0.04 7=0.03 6 =0.005 
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Figure 21: Accumulated infection processes for the situation in Figl^Ul 
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Figure 22: details for the initial phase from fig Ml 
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Figure 23: A=8 £=0.005 a=0.1 (3=0.09 7=0.045 b =0.1 N=952 
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Figure 26: Process splitting for the situation from Fig|2"H 



It is surprising that the system semi-stabilizes after an initial rapid increase 
in the prevalence for a rather long time (Fig|20p. To a certain extend the 
results can be explained along the argumentation in section 7. In Fig]2*2*lthe 
accumulated infection processes for the initial phase are shown. Here the 
e- process, although undercritical, causes a redistribution of infection till a 
clustered configuration is reached such that the a- process can start. Than 
the systems stays in almost complete balance till the j3— process (which is 
slow in all our examples) wins (Figl21|). Note the difference to Fig lT^l where 
the (3— process never really contributes to the infection. Finally we show 
two simulations for a sample of a random set graph model with about 1000 
vertices (Fig|22]and Fig|24j). Although both prevalence curves look similar 
there is a clear difference in the process fine-structure (Fig|2H]and Fig|26|). 
In the first instance the e— and (3— process are causing the collapse whereas 
in the second case the a— process in conjunction with the e— process is the 
main booster. 

The few examples of single simulation runs given in this section show 
already, that there are many different routes to obtain high prevalence in 
corruption typically interrupted by long phases of metastability. Similar to 
other complex systems with hidden phase transitions (e.g. the climate) there 
can be an unnoticed small accumulation of infection till a critical density- a 
point of no return- of corruption is reached from which on an almost com- 
plete saturation of the society (or a corresponding subsystem) by corruption 
becomes the normality. 
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9 Epidemic control 



One of the basic question in classical epidemics as well as in corruption 
dynamics is: what can be done to slow down the "infection" propagation or 
prevalence. Knowing the different phase transitions and their dependency 
on structure properties and social parameters is of great help in designing 
proper prevention scenarios. In the following we will try to relate some of 
the findings from our model to what is considered by practicians as useful 
in corruption reduction. First we would like to emphasize again that the 
present model deals in a rather abstract way with the propagation of mental 
willingness to be corrupt and not so much with realized corruption which 
always requires a specific environment and additional structural assumptions. 
Hence concerning corruption control, we only will be able to support certain 
prevention scenarios in the sense, that they go into the right direction and 
that there effect is strong or weak but without being able to make quantitative 
statements. 

The model presented in this paper contains, besides structural parameters 
for the underlying network, 5 relevant parameters: a- characterizing the 
strength of the local threshold process, (3- characterizing the strength of the 
mean field attraction on becoming corrupt, 7- the strength of the "society 
strikes back" term, e- the strength of the classical epidemic process (assumed 
to be very small) and A- the height of the local threshold. Three of the 
parameters- a, f3 and e- are positively correlated to the spread of corruption 
whereas 2 parameters- A and 7- are negatively correlated. As is well known 
from classical epidemic control for infectious diseases it is very hard if not 
impossible to change basic social parameters in a short time. This can only 
be achieved in a long running educational process. Therefore not much can 
be done in avoiding high clustering in certain relevant areas of the society in 
order to prevent the emergence of highly connected corruption nets. 

As the name already indicates, Transparency International favours as an 
effective tool to decrease corruption especially the increase of transparency 
in all forms of administrative decision making as well as transparency in fi- 
nancial affairs of socially exposed persons, institutions and companies. The 
effect of an increase of transparency translates into our model as an increase 
of the value of A and a decrease of the values of f3, a and e. Strengthening 
of justice, police and similar instruments to fight and uncover corruption has 
again the effect of lowering (3 (via increase of fear) but may also increase the 
value of 7 (uncovering rate). Since an increase of 7 above the value of f3 and 
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a would perhaps require a total police state, it is illusionary to overcome cor- 
ruption just by means of law, justice and police. Besides necessary long term 
educational efforts in school and public to strengthen the moral resistance 
against corruption (increase of A and decrease of a) it seems a good strategy 
to make administrative and political decision hierarchies as independent and 
decentralized as possible to avoid high clustering. 

We would like to end these short remarks by a few comments on the role 
of hubs - the very high degree vertices typically present in scale free graphs - 
in corruption dynamics. While a priori not especially well suited to transmit 
corruption via the a— process due to the local tree like structure around the 
hubs (compared with low degree vertices) they nevertheless are more often 
exposed to corruption and have therefore a higher probability to get corrupt. 
If the hub density is sufficiently high (as is the case for scale-free degree 
distributions with exponent A < 3) and the degree correlation is stronger 
than additive many vertices are linked to the hubs via social dependencies 
and in turn also can get corrupt. Furthermore they may play a fatal role 
in increasing the weighted corruption density relevant for the mean field 
process as was explained at the end of section 7. The described situation 
is probably typical for strongly hierarchically organized countries or regional 
substructures e.g. systems with a dictatorial or monarchical tendency. In 
such societies a high prevalence of corruption seems almost unavoidable since 
the threshold is close to zero. For democratic societies it seems therefore 
wise, to watch the behavior of hubs- whatever their social interpretation 
might be- more intensively than the "normal" part of the society. 

10 Summary and perspectives 

In this article we have presented a first study of the spread of corruption 
on scale free and highly clustered networks. One of the main observations 
so far is the strong dependence of the asymptotic dynamics on the initial 
number of corrupt individuals. This holds as well for the mean field process 
as for the local dynamics. Second there is a fatal resonance effect between 
global and local dynamics lowering dramatically the critical density of initial 
infection. As expected there is a positive correlation between clustering and 
spread of corruption respectively the critical initial density. Scale-freenes 
seems to play an important role for the corruption process for distributions 
with small exponent (A < 3) and multiplicative degree correlation due to 
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the high prevalence of infected hubs and the strong linkage of medium and 
low degree vertices to them. For higher exponents the dynamics is rather 
insensitive to the degree distribution. The strength of the degree correlation 
(from weak - additive till strong - multiplicative or even higher powers) in 
networks of social acquaintances seems to be related to the political and 
institutional structure of a society which favours liberal organization forms 
as being less vulnerable to corruption. 

There is a whole bunch of natural continuations or generalizations which 
have to be investigated next. Clearly a deeper understanding of the pure a— 
process and its phase transitions is necessary. The mathematical problem is 
already highly nontrivial on trees. The following short list gives a selection 
of natural generalizations and refinements: 

- quenched disorder in all parameters 

- inclusion of geographical or regional structure into the network 

- inclusion of administrative or political substructures in which corruption 

typically will be realized 

- evolving networks 

- interaction between the corruption process and the network structure 

- more heterogeneity in the social networks e.g. by incorporating family 

like structures or social profiles 

- weighted networks 

- refined transition rules e.g. asymmetry between infecting and getting 

infected 

- different kinds and strength of corruption and their interplay 

- economic impacts in a virtual population. 

Besides the specific context of corruption dynamics there is a multitude 
of topics where the model presented in this paper could easily be adopted 
to. This includes so different themes as political opinion formation, social 
disorder processes, strategies for advertisement, doping usage, the spread 
of prejudices, migration dynamics, global terrorist networks and innovation 
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processes. In all these examples one has a local and global dynamics very 
similar to the one described here. Of course there are differences. For instance 
in many mind formation problems the state space of individuals is rather 
complex and the local dynamics allows for many transitions not just 0-1 as in 
the corruption model. Furthermore aging phenomena and limits of resources 
could be included. But besides this addition of structure and complexity 
and the various interpretations there remains a good part of the findings of 
this work to be true. There will be phase transitions in the initial density of 
certain properties and there can be resonance effects between the nonlinear 
global and local dynamics - both making the prediction of future difficult 
and challenging. 
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